Speech processing system and speech processing method

ABSTRACT

Speech processing system includes a first sound receiving device, a second sound receiving device, a main controller, and an audio processor. First and second sound receiving devices are configured to generate a main voice signal or a secondary voice signal. A first sensing device in first sound receiving device and a second sensing device in second sound receiving device are configured to output a first sensing signal or a second sensing signal based on a sensing result. Main controller controls first sound receiving device to generate main voice signal and controls second sound receiving device to generate secondary voice signal when receiving first sensing signal. Main controller controls second sound receiving device to generate main voice signal and controls first sound receiving device to generate secondary voice signal when receiving second sensing signal. Audio processor is configured to process main and secondary voice signals into an output voice signal.

RELATED APPLICATIONS

This application claims priority to Taiwan Application Serial Number108111049, filed Mar. 28, 2019, which is herein incorporated byreference.

BACKGROUND Field of Invention

The present disclosure relates to a speech processing system and aspeech processing method. More particularly, the present disclosurerelates to a speech processing system and a speech processing methodcapable of sensing a user to automatically turn on.

Description of Related Art

Main performances of a sound receiving device, such as frequencyresponse, are determined by sound receiving units. The number of unitsmay be one to several tens, and this number is determined by the soundreceiving method and function of the sound receiving device. The commonsound receiving system includes the sound receiving units plus thesignal amplifier circuit. The sound receiving effect of a soundreceiving device is mainly determined by the selection of electroniccomponents and the modification of back-end software. However, under thesame conditions, it is difficult to create differentiation.

For the foregoing reasons, there is a need to further improve the soundreceiving effect of a sound receiving device, such as a microphone, byproviding a new method.

SUMMARY

A speech processing system is provided. The speech processing systemincludes a first sound receiving device, a second sound receivingdevice, a main controller, and an audio processor. The first soundreceiving device and the second sound receiving device are configured toreceive a main voice signal or a secondary voice signal. A first sensingdevice in the first sound receiving device and a second sensing devicein the second sound receiving device are configured to output a firstsensing signal or a second sensing signal based on a sensing result. Themain controller controls the first sound receiving device to generatethe main voice signal and controls the second sound receiving device togenerate the secondary voice signal when receiving the first sensingsignal. The main controller controls the second sound receiving deviceto generate the main voice signal and controls the first sound receivingdevice to generate the secondary voice signal when receiving the secondsensing signal. The audio processor is configured to process the mainvoice signal and the secondary voice signal into an output voice signal.

The present disclosure provides a speech processing method including thefollowing operations: outputting a first sensing signal or a secondsensing signal through a sensing result of a first sensing device or asecond sensing device; receiving the first sensing signal or the secondsensing signal through a main controller; controlling switching statesof a first switch and a second switch to couple the first soundreceiving device and the second sound receiving device to an audioprocessor; controlling the first sound receiving device to generate amain voice signal and controlling the second sound receiving device togenerate a secondary voice signal when receiving the first sensingsignal; controlling the second sound receiving device to generate themain voice signal and controlling the first sound receiving device togenerate the secondary voice signal when receiving the second sensingsignal; and processing the main voice signal and the secondary voicesignal into an output voice signal through the audio processor.

In summary, the first sensing device or the second sensing device isused to detect whether the user approaches the first sound receivingdevice or the second sound receiving device. When the user approaches,the main controller turns on the corresponding first sound receivingdevice or second sound receiving device based on the position of theuser. Subsequently, the audio processor is used to process the voicesignals received by the first sound receiving device and the secondsound receiving device into the output voice signal.

It is to be understood that both the foregoing general description andthe following detailed description are by examples, and are intended toprovide further explanation of the invention as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings are included to provide a furtherunderstanding of the invention, and are incorporated in and constitute apart of this specification. The drawings illustrate embodiments of theinvention and, together with the description, serve to explain theprinciples of the invention. In the drawings,

FIG. 1 depicts a simplified functional block diagram of a speechprocessing system according to one embodiment of the present disclosure;

FIG. 2 depicts a functional block diagram of a speech processing systemaccording to one embodiment of the present disclosure;

FIG. 3 depicts a schematic diagram of a configuration of a soundreceiving device according to one embodiment of the present disclosure;

FIG. 4 depicts a flowchart of a speech processing method according toone embodiment of the present disclosure;

FIG. 5 depicts a flowchart of a speech processing method according toanother embodiment of the present disclosure; and

FIG. 6 depicts a flowchart of a speech processing method according tostill another embodiment of the present disclosure.

DESCRIPTION OF THE EMBODIMENTS

It should be understood that the terms, “comprising”, “including”,“having” and the like, used herein are open-ended, that is, includingbut not limited to. It will be understood that, as used herein, thephrase “and/or” includes any and all combinations of one or more of theassociated listed items.

In this document, the term “coupled” may also be termed “electricallycoupled,” and the term “connected” may be termed “electricallyconnected.” “Coupled” and “connected” may also be used to indicate thattwo or more devices cooperate or interact with each other. It will beunderstood that, although the terms “first,” “second,” etc., may be usedherein to describe various devices, they are merely used to distinguishthe devices or operations described with the same technical terms.Unless otherwise specified, these terms do not refer to or imply theorder or priority, nor are they intended to limit the presentdisclosure.

In the following detailed description, for purposes of explanation,numerous specific details are set forth in order to provide a thoroughunderstanding of the disclosed embodiments. It will be apparent,however, that one or more embodiments may be practiced without thesespecific details. In other instances, well-known structures and devicesare schematically depicted in order to simplify the drawings. Whereverpossible, the same reference numbers are used in the drawings and thedescription to refer to the same or like parts.

A description is provided with reference to FIG. 1. FIG. 1 depicts asimplified functional block diagram of a speech processing systemaccording to one embodiment of the present disclosure. A speechprocessing system 100 includes a plurality of sound receiving devices120 a, 120 b, an audio processor 140 a, an audio processor 140 b, anaudio processor 140 c, and a main controller 160. The sound receivingdevice 120 a and the sound receiving device 120 b are communicativelyconnected to the audio processor 140 a, the audio processor 140 b, andthe audio processor 140 c. The audio processor 140 a, the audioprocessor 140 b, and the audio processor 140 c are communicativelyconnected to the main controller 160. To simplify matters, only twosound receiving devices and three audio processors are depicted in FIG.1, but the actual number of the sound receiving devices and the numberof the audio processors are not limited to the above and may be adjusteddepending on practical requirements.

In one embodiment, the speech processing system 100 may be a microphonearray system. The sound receiving device 120 a and the sound receivingdevice 120 b are configured to receive sounds, such as voices, ambientsounds, and the like, to generate voice signals. Each of the soundreceiving device 120 a and the sound receiving device 120 b may be amicrophone or some other device having the function of receiving soundsto generate voice signals. The audio processor 140 a, the audioprocessor 140 b and the audio processor 140 c are configured to processthe voice signals that are generated through the sound receiving device120 a and the sound receiving device 120 b. The main controller 160 isconfigured to receive a signal of a sensing device, and control thesound receiving device 120 a and the sound receiving device 120 bcorrespondingly based on a sensing result of the sensing device. Themain controller 160 can also receive the voice signals processed by theaudio processor 140 a, the audio processor 140 b and the audio processor140 c. Each of the main controller 160, the audio processor 140 a, theaudio processor 140 b, and the audio processor 140 c may be a centralprocessing unit, a microprocessor, or some other device having dataprocessing functions. Operation methods of the speech processing system100 are illustrated as follows by way of examples.

A description is provided with reference to FIG. 2 and FIG. 3. FIG. 2depicts a functional block diagram of a speech processing systemaccording to one embodiment of the present disclosure. FIG. 3 depicts aschematic diagram of a configuration of a sound receiving deviceaccording to one embodiment of the present disclosure. In the presentembodiment, six sound receiving devices and three audio processors aretaken as an example for illustration. The speech processing system 100includes the sound receiving devices 120 a-120 b and sound receivingdevices 120 c-120 f. The sound receiving devices 120 a-120 f includesensing devices 122 a-122 f respectively and are coupled to switches 180a-180 f respectively. To simplify matters, only the sound receivingdevice 120 a, the sound receiving device 120 b, the sensing device 122a, the sensing device 122 b, the switch 180 a, and the switch 180 b aredepicted, as shown in FIG. 2. The sound receiving devices 120 a-120 fand the sensing devices 122 a-122 f are independent devices and havepositional correspondences.

The sensing devices 122 a-122 f are configured to sense an object (suchas a human body) or an environmental factor (such as temperature,distance, or sound volume). The speech processing system 100 determineswhether a user approaches through the sensing devices 122 a-122 fdisposed at different positions. When the user approaches, it might meanthat the user may use one of the sound receiving devices 120 a-120 f tospeak or record. Each of the sensing devices 122 a-122 f may be atemperature sensing device, a distance sensing device, a volume sensingdevice, or other devices having a sensing function, or combinationsthereof.

Each of the switches 180 a-180 f is constituted by a plurality oftransfer switches. The switches 180 a-180 f can be used to switchconnections between the corresponding sound receiving devices 120 a-120f and the audio processors 140 a, 140 b, 140 c through a line ain, aline aref, a line bin, a line bref, a line cin, or a line cref. Not onlydoes the main controller 160 control the sound receiving devices 120a-120 f based on the sensing result of the sensing devices 122 a-122 f,but the main controller 160 can also control the corresponding switches180 a-180 f to switch the connections between the sound receivingdevices 120 a-120 f and the audio processors 140 a, 140 b, 140 c basedon the sensing result of the sensing devices.

A description is provided with reference to FIG. 4. FIG. 4 depicts aflowchart of a speech processing method according to one embodiment ofthe present disclosure. In order to facilitate the understanding of thespeech processing method shown in FIG. 4, the description is alsoprovided with reference to FIG. 2 and FIG. 3. A method 200 includes stepS210, step S220, step S230, step S240, step S250, and step S260. Thoseskilled in the art would understand that the speech processing methodaccording to the embodiment of the present disclosure is not limited tobeing applied to the speech processing system 100 of FIG. 2 and FIG. 3,and is not limited to the sequence of steps of the flowchart shown inFIG. 4. In the present embodiment, an example for illustration is givento take the sound receiving device 120 a as a first sound receivingdevice and the sound receiving device 120 d as a second sound receivingdevice. The sound receiving device 120 a and the sound receiving device120 d can generate a first voice signal and a second voice signal afterreceiving a sound. For example, when the user uses the sound receivingdevice 120 a to start speaking, the sound receiving device 120 areceives the voice spoken by the user, and generates the first voicesignal, which serves as a main voice signal. At this time, the soundreceiving device 120 d receives the environmental sound and generatesthe second voice signal, which serves as a secondary voice signal.Alternatively, when the user uses the sound receiving device 120 a tostart speaking, the sound receiving device 120 a can receive the voicespoken by the user and part of the environmental sound, and generatesthe first voice signal, which serves as the main voice signal. The soundreceiving device 120 d receives the environmental sound and part of thevoice spoken by the user and generates the second voice signal, whichserves as the secondary voice signal.

In the present embodiment, the sensing device 122 a in the soundreceiving device 120 a is a first sensing device. A sensing device 122 din the sound receiving device 120 d is a second sensing device. Theswitch 180 a corresponding to the sound receiving device 120 a is afirst switch. The switch 180 d corresponding to the sound receivingdevice 120 d is a second switch. It is noted that the second soundreceiving device is disposed opposite to the first sound receivingdevice. Therefore, the first sound receiving device and the second soundreceiving device may be illustrated by taking the sound receiving device120 b and the sound receiving device 120 e, or the sound receivingdevice 120 c and the sound receiving device 120 f as an example.

In step S210, a first sensing signal or a second sensing signal isoutput based on a sensing result of the sensing device 122 a and thesensing device 122 d.

In one embodiment, each of the sensing devices 122 a-122 f may be atemperature sensing device. The sensing result may be a temperaturesensing result. For example, in the speech processing system 100, areference temperature (such as 26 degrees Celsius) could be preset. Whena temperature detected by the sensing device is higher than thereference temperature, it may indicate that the user is likely toapproach the sensing device. The main controller 160 turns on the soundreceiving device corresponding to the sensing device. For example, whenthe user approaches the sound receiving device 120 a, at this time thesensing device 122 a detects that the temperature rises and is higherthan the preset reference temperature. The sensing device 122 atransmits the first sensing signal to the main controller 160. Or, thereference temperature could be a reference temperature range, such as 26to 40 degrees Celsius. When the user approaches the sound receivingdevice 120 a, at this time the sensing device 122 a detects that a bodytemperature of the user is within the preset reference temperaturerange. The sensing device 122 a transmits the first sensing signal tothe main controller 160.

In step S220, the first sensing signal or the second sensing signal isreceived by the main controller 160.

If the user is located in front of the sensing device 122 a, in stepS230, the main controller 160 controls switching states of the switch180 a (the first switch) and the switch 180 d (the second switch) toallow the sound receiving device 120 a (the first sound receivingdevice) and the sound receiving device 120 d (the second sound receivingdevice) to be coupled to the audio processor. For example, when the maincontroller 160 receives the first sensing signal, the main controller160 controls the switch 180 a to allow the sound receiving device 120 acorresponding to the switch 180 a to be coupled to the audio processor140 a through the line ain. The main controller 160 also controls theswitch 180 d to allow the sound receiving device 120 d corresponding tothe switch 180 d to be coupled to the audio processor 140 a through theline aref. If the user is located in front of the sensing device 122 d,the main controller 160 controls the switch 180 a to allow the soundreceiving device 120 a corresponding to the switch 180 a to be coupledto the audio processor 140 a through the line aref when the main control160 receives the second sensing signal. The main controller 160 alsocontrols the switch 180 d to allow the sound receiving device 120 dcorresponding to the switch 180 d to be coupled to the audio processor140 a through the line ain.

When the main controller 160 receives the first sensing signal, stepS240 is executed. The main controller 160 controls the sound receivingdevice 120 a (the first sound receiving device) to generate the mainvoice signal, and controls the sound receiving device 120 d (the secondsound receiving device) to generate the secondary voice signal. Forexample, the main controller 160 controls the sound receiving device 120a (the first sound receiving device) to receive the sound so as togenerate the first voice signal, uses the first voice signal generatedby the sound receiving device 120 a as the main voice signal, and allowsthe generated first voice signal (the main voice signal) to betransmitted to the audio processor 140 a through the switch 180 a andthe line ain. At the same time, the main controller 160 correspondinglycontrols the sound receiving device 120 d (the second sound receivingdevice) to receive the sound so as to generate the second voice signal,uses the second voice signal generated by the sound receiving device 120d as the secondary voice signal, and allows the generated second voicesignal (the secondary voice signal) to be transmitted to the audioprocessor 140 a through the switch 180 d and the line aref.

When the main controller 160 receives the second sensing signal, stepS250 is executed. The main controller 160 controls the sound receivingdevice 120 d (the second sound receiving device) to generate the mainvoice signal, and controls the sound receiving device 120 a (the firstsound receiving device) to generate the secondary voice signal. Forexample, the main controller 160 controls the sound receiving device 120d (the second sound receiving device) to receive the sound so as togenerate the second voice signal, uses the second voice signal generatedby the sound receiving device 120 d as the main voice signal, and allowsthe generated second voice signal (the main voice signal) to betransmitted to the audio processor 140 a through the switch 180 d andthe line ain. At the same time, the main controller 160 correspondinglycontrols the sound receiving device 120 a (the first sound receivingdevice) to receive the sound so as to generate the first voice signal,uses the first voice signal generated by the sound receiving device 120a as the secondary voice signal, and allows the generated first voicesignal (the secondary voice signal) to be transmitted to the audioprocessor 140 a through the switch 180 a and the line aref.

In step S260, the main voice signal and the secondary voice signal areprocessed into an output voice signal through the audio processors 140a-140 c.

To simplify matters, only two switches 180 a and 180 b are depicted, asshown in FIG. 2. Each of the switches 180 a-180 f is constituted by theplurality of transfer switches, and the signals are one-in-one-outsignals. The main controller 160 is used to switch corresponding soundreceiving devices 120 to suitable lines, such as the line ain, the linearef, the lin bin, the line bref, the line cin, or the line cref. Forexample, when the sound receiving device 120 a receives the main voicesignal and the sound receiving device 120 d receives the secondary voicesignal, the main controller 160 switches the switch 180 a to the lineain and switches the switch 180 d to the line aref. The audio processor140 a receives the main voice signal of the line ain and the secondaryvoice signal of the line aref. In addition, the audio processor 140 aprocesses the main voice signal and the secondary voice signal into theoutput voice signal, and outputs the output voice signal to the maincontroller 160. Processing the main voice signal and the secondary voicesignal into the output voice signal mentioned above, for example, couldbe that the main voice signal and the secondary voice signal can beprocessed by the environmental noise cancellation (ENC) technology toremove various environmental interference noises in the main voicesignal, such as speech of other people, noises generated by vehicles,wind noises, and the like, through subtracting the secondary voicesignal from the main voice signal so as to obtain the output voicesignal.

A description is provided with reference to FIG. 5. FIG. 5 depicts aflowchart of a speech processing method according to another embodimentof the present disclosure. In order to facilitate the understanding ofthe speech processing method shown in FIG. 5, the description is alsoprovided with reference to FIG. 2 and FIG. 3. Those skilled in the artwould understand that the speech processing method according to thepresent embodiment is not limited to being applied to the speechprocessing system 100 of FIG. 2 and FIG. 3, and is not limited to thesequence of steps of the flowchart shown in FIG. 5. A method 300includes step S211, step S220, step S230, step S240, step S250, and steps260.

In step S211, a sensing result of the sensing device 122 a in the soundreceiving device 120 a and the sensing device 122 d in the soundreceiving device 120 d is a result of sensing whether an objectapproaches. When a user approaches the sound receiving device 120 a, thesensing device 122 a in the sound receiving device 120 a senses that anobject approaches, and outputs the first sensing signal. Similarly, whenthe user approaches the sound receiving device 120 d, the sensing device122 d in the sound receiving device 120 d senses that the objectapproaches, and outputs the second sensing signal. If the sensing device122 a and the sensing device 122 d do not sense any object, the sensingis continued. Since step S220 to step S260 are the same as thosedescribed in FIG. 4, a description in this regard is not provided.

In one embodiment, the speech processing system 100 includes a processor190. After the main controller 160 receives the output voice signal, theoutput voice signal can be transmitted to the processor 190. Theprocessor 190 can receive a plurality of output voice signals processedby the audio processor 140 a, the audio processor 140 b, and the audioprocessor 140 c, and perform subsequent processes. The processor 190 maybe a computer or a mobile phone, and is connected to the main controller160 through Universal Serial Bus (USB) or Bluetooth, but the disclosureis not limited in this regard.

For example, three users A, B, and C simultaneously use the speechprocessing system 100, and are located in front of different sensingdevices. The user A is located in front of the sensing device 122 a. Theuser B is located in front of the sensing device 122 b. The user C islocated in front of the sensing device 122 c. A voice signal of the userA can be received and processed by the sound receiving devices 120 a,120 d and the audio processor 140 a. A voice signal of the user B can bereceived and processed by the sound receiving devices 120 b, 120 e andthe audio processor 140 b. A voice signal of the user C can be receivedand processed by the sound receiving devices 120 c, 120 f and the audioprocessor 140 c. The speech processing system 100 uses the three audioprocessors 140 a, 140 b, and 140 c to process the voice signals of thethree users A, B and C and the results are output to the main controller160. The main controller 160 can simultaneously receive output voicesignals of the three audio processors and transport the output voicesignals on channel 1, channel 2, and channel 3 respectively. When it isnecessary to switch or select different channels, the main controller160 or the processor 190 can perform switching or selection throughinternal switches (not illustrated in the figures).

In the present embodiment, the main controller 160 may be connected to acomputer through, for example, USB or wireless communication. Thecomputer can simultaneously process the voice signals of the users A, B,and C, and can further display the voice signals of the users A, B, andC respectively on the channel 1, the channel 2, and the channel 3through a software on the computer. If a fourth user D wants to talk tothe user A through a software of a mobile phone or a computer, the userD can select the appropriate channel by using the software to talk tothe user A. If the voice signal of the user A is on the channel 1, theuser D can select, by using the software, to only play the voice signalon the channel 1. The main controller 160 or the processor 190 canswitch to the channel 1 through the internal switch and output the voicesignal of the channel 1 to avoid interferences from other channels. Inaddition, the main controller 160 or the processor 190 can further turnoff the unselected channels 2 and 3 according to the user's operation toavoid interferences from other channels that are not selected.

In one embodiment, each of the sound receiving devices 120 a-120 fincludes an amplifier respectively (not illustrated in the figures). Theamplifiers are configured to enhance or attenuate the voice signalsgenerated by the sound receiving devices 120 a-120 f according to amagnification. In the embodiment, the sensing devices 122 a and 122 d inthe sound receiving device 120 a and the sound receiving device 120 dinclude distance sensing devices. The distance sensing devices areconfigured to sense distances of an object (such as a human body) thatapproaches the sound receiving device 120 a and the sound receivingdevice 120 d from the distance sensing devices, i.e. an object distance.When an object distance between the object and the distance sensingdevice is less than a reference distance, it means that the user iscloser to the sound receiving device 120 a or 120 d. The sensing device122 a or 122 d corresponding to the sound receiving device 120 a or 120d transmits a lowering signal to the main controller 160 to lower themagnification of the amplifier in the corresponding sound receivingdevice 120 a or sound receiving device 120 d so as to reduce the volumeof sound received. When the object distance between the object and thedistance sensing device is greater than a reference distance, it meansthat the user is farther away from the sound receiving device 120 a or120 d. The sensing device 122 a or 122 d corresponding to the soundreceiving device 120 a or 120 d transmits an enhancing signal to themain controller 160 to enhance the magnification of the amplifier in thecorresponding sound receiving device 120 a or sound receiving device 120d so as to increase the volume of sound received.

In one embodiment, the sensing devices 122 a and 122 d in the soundreceiving device 120 a and the sound receiving device 120 d includevolume sensing devices. The volume sensing devices are configured tosense voice volumes. When the voice volume is higher than a referencevolume, it means that the user is closer to the sound receiving device120 a or 120 d. The sensing device 122 a or 122 d corresponding to thesound receiving device 120 a or 120 d transmits the lowering signal tothe main controller 160 to lower the magnification of the amplifier inthe corresponding sound receiving device 120 a or sound receiving device120 d so as to reduce the volume of sound received. When the voicevolume is lower than the reference volume, it means that the user isfarther away from the sound receiving device 120 a or 120 d. The sensingdevice 122 a or 122 d corresponding to the sound receiving device 120 aor 120 d transmits the enhancing signal to the main controller 160 toenhance the magnification of the amplifier in the corresponding soundreceiving device 120 a or sound receiving device 120 d so as to increasethe volume of sound received.

The sensing devices 122 a-122 f can set different sensing methodsdepending on practical requirements. For example, the sensing devices122 a-122 f can be set to send the sensing signals after sensing theproximity of the object and the sound. A description is provided withreference to FIG. 6. FIG. 6 depicts a flowchart of a speech processingmethod according to still another embodiment of the present disclosure.In order to facilitate the understanding of the speech processing methodshown in FIG. 6, the description is also provided with reference to FIG.2 and FIG. 3. Those skilled in the art would understand that the speechprocessing method according to the present embodiment is not limited tobeing applied to the speech processing system 100 of FIG. 2 and FIG. 3,and is not limited to the sequence of steps of the flowchart shown inFIG. 6.

A method 400 includes step S212, step S213, step S220, step S230, stepS240, step S250, and step S260. In step S212, if the user is located infront of the sensing device 122 a, the sensing device 122 a senses thatthe user (object) approaches. At this time, the first sensing signal isnot sent yet. The method of sensing that the user (object) approachesmay be, for example, the above temperature sensing method or distancesensing method, etc., but the present disclosure is not limited in thisregard. After the object is sensed, step S213 is executed. When the userstarts speaking, the sensing device 122 a senses the voice volume anddetermines whether it is higher than or equal to a reference volume. Thereference volume may be set to 60 decibels that is the general humanspeaking volume, and the first sensing signal is sent to the maincontroller 160 when the voice volume is higher than or equal to thereference volume. Since step S220 to step S260 are the same as thosedescribed in FIG. 4, a description in this regard is not provided. Onoccasions where the number of users is large, the method of settingdouble conditions according to the present embodiment can be utilized toimprove the accuracy of judgment.

A number of the sound receiving devices in the speech processing system100 may be adjusted depending on practical applications, and a number ofthe audio processors may also be adjusted depending on the number of thesound receiving devices. In the embodiments of the present disclosure,the six sound receiving devices 120 a-120 f are taken as an example sothe three audio processors 140 a-140 c are correspondingly disposed. Inother words, the speech processing system 100 can process three sets ofvoice signals.

In summary, the speech processing system uses the respective sensingdevices of the sound receiving devices to determine whether the userapproaches or whether the user wants to use the sound receiving device.When the sound receiving devices are turned on, the sensing devicessense the distances or volumes of the user so that the volume of soundreceived can be automatically adjusted. The audio processor processesthe main voice signal and the secondary voice signal into the outputvoice signal. The user can select the desired output voice signaldepending on practical use conditions.

Although the present invention has been described in considerable detailwith reference to certain embodiments thereof, other embodiments arepossible. Therefore, the spirit and scope of the appended claims shouldnot be limited to the description of the embodiments contained herein.

It will be apparent to those skilled in the art that variousmodifications and variations can be made to the structure of the presentinvention without departing from the scope or spirit of the invention.In view of the foregoing, it is intended that the present inventioncover modifications and variations of this invention provided they fallwithin the scope of the following claims and their equivalents.

What is claimed is:
 1. A speech processing system, comprising: a firstsound receiving device configured to generate a first voice signal,wherein the first sound receiving device comprises: a first sensingdevice configured to output a first sensing signal in condition that auser approaches the first sound receiving device; a second soundreceiving device disposed corresponding to the first sound receivingdevice and configured to generate a second voice signal, wherein thesecond sound receiving device comprises: a second sensing deviceconfigured to output a second sensing signal in condition that the userapproaches the second sound receiving device; a main controller usingthe first voice signal as a main voice signal and using the second voicesignal as a secondary voice signal when receiving the first sensingsignal; the main controller using the second voice signal as the mainvoice signal and using the first voice signal as the secondary voicesignal when receiving the second sensing signal; and an audio processorconfigured to process the main voice signal and the secondary voicesignal into an output voice signal; wherein each of the first soundreceiving device and the second sound receiving device furthercomprises: an amplifier configured to enhance or attenuate the firstvoice signal or the second voice signal according to a magnification;wherein when a first object distance sensed by the first sensing deviceis less than a reference distance, the first sensing device transmits afirst lowering signal to the main controller to lower the magnificationof the amplifier in the first sound receiving device, and when the firstobject distance is larger than the reference distance, the first sensingdevice transmits a first enhancing signal to the main controller toenhance the magnification of the amplifier in the first sound receivingdevice; and wherein when a second object distance sensed by the secondsensing device is less than the reference distance, the second sensingdevice transmits a second lowering signal to the main controller tolower the magnification of the amplifier in the second sound receivingdevice, and when the second object distance is larger than the referencedistance, the second sensing device transmits a second enhancing signalto the main controller to enhance the magnification of the amplifier inthe second sound receiving device.
 2. The speech processing system ofclaim 1, wherein when a first temperature sensed by the first sensingdevice is higher than a reference temperature, the first sensing deviceoutputs the first sensing signal to the main controller so as to controlthe first sound receiving device to generate the main voice signal andcontrol the second sound receiving device to generate the secondaryvoice signal, and when a second temperature sensed by the second sensingdevice is higher than the reference temperature, the second sensingdevice outputs the second sensing signal to the main controller so as tocontrol the second sound receiving device to generate the main voicesignal and control the first sound receiving device to generate thesecondary voice signal.
 3. The speech processing system of claim 1,wherein when a first voice volume sensed by the first sensing device ishigher than a reference volume, the first sensing device transmits thefirst lowering signal to the main controller to lower the magnificationof the amplifier in the first sound receiving device, and when the firstvoice volume sensed by the first sensing device is lower than thereference volume, the first sensing device transmits the first enhancingsignal to the main controller to enhance the magnification of theamplifier in the first sound receiving device; and wherein when a secondvoice volume sensed by the second sensing device is higher than thereference volume, the second sensing device transmits the secondlowering signal to the main controller to lower the magnification of theamplifier in the second sound receiving device, and when the secondvoice volume sensed by the second sensing device is lower than thereference volume, the second sensing device transmits the secondenhancing signal to the main controller to enhance the magnification ofthe amplifier in the second sound receiving device.
 4. The speechprocessing system of claim 1, further comprising: a first switch; and asecond switch; wherein when the main controller receives the firstsensing signal, the main controller switches the first switch to use thefirst voice signal as the main voice signal and switches the secondswitch to use the second voice signal as the secondary voice signal, andwhen the main controller receives the second sensing signal, the maincontroller switches the first switch to use the first voice signal asthe secondary voice signal and switches the second switch to use thesecond voice signal as the main voice signal.
 5. The speech processingsystem of claim 1, further comprising: a processor configured to receivethe output voice signal from one of a plurality of channels, whereinwhen the processor selects one of the channels, unselected channelsother than the one of the channels are turned off.
 6. The speechprocessing system of claim 1, wherein the second sound receiving deviceis disposed opposite to the first sound receiving device.
 7. The speechprocessing system of claim 1, wherein the audio processor is furtherconfigured to obtain the output voice signal through subtracting thesecondary voice signal from the main voice signal.
 8. A speechprocessing method, comprising: outputting a first sensing signal to amain controller in condition that a user approaches a first sensingdevice in a first sound receiving device; outputting a second sensingsignal to the main controller in condition that the user approaches asecond sensing device in a second sound receiving device; controllingswitching states of a first switch and a second switch to couple thefirst sound receiving device and the second sound receiving device to anaudio processor; controlling the first sound receiving device togenerate a main voice signal and controlling the second sound receivingdevice to generate a secondary voice signal when receiving the firstsensing signal; controlling the second sound receiving device togenerate the main voice signal and controlling the first sound receivingdevice to generate the secondary voice signal when receiving the secondsensing signal; and processing the main voice signal and the secondaryvoice signal into an output voice signal through the audio processor;enhancing or attenuating the main voice signal or the secondary voicesignal generated by the first sound receiving device or the second soundreceiving device through an amplifier; wherein when a first objectdistance sensed by the first sensing device is less than a referencedistance, transmitting, by the first sensing device, a first loweringsignal to the main controller, and when the first object distance sensedby the first sensing device is greater than the reference distance,transmitting, by the first sensing device, a first enhancing signal tothe main controller; wherein when a second object distance sensed by thesecond sensing device is less than the reference distance, transmitting,by the second sensing device, a second lowering signal to the maincontroller, and when the second object distance sensed by the secondsensing device is greater than the reference distance, transmitting, bythe second sensing device, a second enhancing signal to the maincontroller.
 9. The speech processing method of claim 8, wherein when afirst temperature sensed by the first sensing device is higher than areference temperature, the first sensing device outputs the firstsensing signal to the main controller so as to control the first soundreceiving device to generate the main voice signal and control thesecond sound receiving device to generate the secondary voice signal,and when a second temperature sensed by the second sensing device ishigher than the reference temperature, the second sensing device outputsthe second sensing signal to the main controller so as to control thesecond sound receiving device to generate the main voice signal andcontrol the first sound receiving device to generate the secondary voicesignal.
 10. The speech processing method of claim 9, wherein when thefirst temperature sensed by the first sensing device is higher than thereference temperature and a voice volume sensed by the first sensingdevice is higher than or equal to a reference volume, the first sensingdevice outputs the first sensing signal to the main controller so as tocontrol the first sound receiving device to generate the main voicesignal and control the second sound receiving device to generate thesecondary voice signal, and when the second temperature sensed by thesecond sensing device is higher than the reference temperature and thevoice volume sensed by the second sensing device is higher than or equalto the reference volume, the second sensing device outputs the secondsensing signal to the main controller so as to control the second soundreceiving device to generate the main voice signal and control the firstsound receiving device to generate the secondary voice signal.
 11. Thespeech processing method of claim 8, wherein when a first voice volumesensed by the first sensing device is higher than a reference volume,transmitting, by the first sensing device, the first lowering signal tothe main controller, and when the first voice volume sensed by the firstsensing device is lower than the reference volume, transmitting, by thefirst sensing device, the first enhancing signal to the main controller;wherein when a second voice volume sensed by the second sensing deviceis higher than the reference volume, transmitting, by the second sensingdevice, the second lowering signal to the main controller, and when thesecond voice volume sensed by the second sensing device is lower thanthe reference volume, transmitting, by the second sensing device, thesecond enhancing signal to the main controller.
 12. The speechprocessing method of claim 8, wherein the step of controlling the firstsound receiving device to generate the main voice signal and controllingthe second sound receiving device to generate the secondary voice signalwhen receiving the first sensing signal comprises: switching the firstswitch to use a first voice signal generated by the first soundreceiving device as the main voice signal, and switching the secondswitch to use a second voice signal generated by the second soundreceiving device as the secondary voice signal through the first switchand the second switch.
 13. The speech processing method of claim 12,wherein the step of controlling the second sound receiving device togenerate the main voice signal and controlling the first sound receivingdevice to generate the secondary voice signal when receiving the secondsensing signal further comprises: switching the first switch to use thefirst voice signal as the secondary voice signal, and switching thesecond switch to use the second voice signal as the main voice signalthrough the first switch and the second switch.
 14. The speechprocessing method of claim 8, further comprising: receiving the outputvoice signal from one of a plurality of channels through a processor,wherein when the processor selects one of the channels, unselectedchannels other than the one of the channels are turned off.